Noise Thresholds for Spectral Clustering

نویسندگان

  • Sivaraman Balakrishnan
  • Min Xu
  • Akshay Krishnamurthy
  • Aarti Singh
چکیده

Although spectral clustering has enjoyed considerable empirical success in machine learning, its theoretical properties are not yet fully developed. We analyze the performance of a spectral algorithm for hierarchical clustering and show that on a class of hierarchically structured similarity matrices, this algorithm can tolerate noise that grows with the number of data points while still perfectly recovering the hierarchical clusters with high probability. We additionally improve upon previous results for k-way spectral clustering to derive conditions under which spectral clustering makes no mistakes. Further, using minimax analysis, we derive tight upper and lower bounds for the clustering problem and compare the performance of spectral clustering to these information theoretic limits. We also present experiments on simulated and real world data illustrating our results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assessment of the Performance of Clustering Algorithms in the Extraction of Similar Trajectories

In recent years, the tremendous and increasing growth of spatial trajectory data and the necessity of processing and extraction of useful information and meaningful patterns have led to the fact that many researchers have been attracted to the field of spatio-temporal trajectory clustering. The process and analysis of these trajectories have resulted in the extraction of useful information whic...

متن کامل

Agglomerative hierarchical kernel spectral clustering for large scale networks

We propose an agglomerative hierarchical kernel spectral clustering (AH-KSC) model for large scale complex networks. The kernel spectral clustering (KSC) method uses a primal-dual framework to build a model on a subgraph of the network. We exploit the structure of the projections in the eigenspace to automatically identify a set of distance thresholds. These thresholds lead to the different lev...

متن کامل

تجزیه‌ ی تُنُک تصاویر ابرطیفی با استفاده از یک کتابخانه‌ ی طیفی هرس شده

Spectral unmixing of hyperspectral images is one of the most important research fields  in remote sensing. Recently, the direct use of spectral libraries in spectral unmixing is on increase. In this way  which is called sparse unmixing, we do not need an endmember extraction algorithm and the number determination of endmembers priori. Since spectral libraries usually contain highly correlated s...

متن کامل

Perceptual musical noise reduction using critical bands tonality coefficients and masking thresholds

Speech enhancement techniques using spectral subtraction have the drawback of generating an annoying musical noise. We develop a new post-processing method for reducing it in each critical-band. In the proposed technique, the difference between tonality coefficients of the noisy speech and the denoised one constitutes one step for detection. Next, using a modified Johnston masking threshold, we...

متن کامل

Spectral Clustering for Robust Motion Segmentation

In this paper, we propose a robust motion segmentation method based on the matrix factorization and subspace separation. We, first, mathematically prove that the shape interaction matrix can be derived using QR decomposition rather than Singular Value Decomposition(SVD). Using shape interaction matrix, we solve the motion segmentation problem using spectral graph clustering technique. We exploi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011